Toponym Resolution: A First Large-Scale Comparative Evaluation

نویسنده

  • Jochen L. Leidner
چکیده

Toponym resolution (TR) is the task of mapping the name of a location to a spatial representation of the location referred to, such as the centroid of the location, given as latitude/longitude. While a number of systems for automating the task have been described in the literature, to date no comparative evaluation study has existed, mainly for lack of a standard benchmark (i.e., gazetteer and evaluation corpus). On the basis of a benchmark methodology and dataset, we present the first systematic account of the utility of different heuristics for the toponym resolution task, based on experimental comparison on two novel, large-scale gold-standard corpora. Each heuristic’s utility is evaluated in isolation, and in addition, two previously reported complex methods are replicated in full.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Reference Corpus for Automatic Toponym Resolution Evaluation

Spatial named entities ground events in space, and this relationship is essential for advanced text processing applications such as question answering and event tracking. Toponym resolution is the task of mapping from an entity to a spatial representation (an extensional coordinate model), given the context. Whereas work on the temporal dimension is ongoing [17], to date no reference corpus exi...

متن کامل

Creating a Novel Geolocation Corpus from Historical Texts

This paper describes the process of annotating a historical US civil war corpus with geographic reference. Reference annotations are given at two different textual scales: individual place names and documents. This is the first published corpus of its kind in document-level geolocation, and it has over 10,000 disambiguated toponyms, double the amount of any prior toponym corpus. We outline many...

متن کامل

Comparative Evaluation of Image Fusion Methods for Hyperspectral and Panchromatic Data Fusion in Agricultural and Urban Areas

Nowadays remote sensing plays a key role in the field of earth science studies due to some of the advantages, including data collection at a very low cost and time on a very large scale. Meanwhile, using hyperspectral data is of great importance due to the high spectral resolution. Because of some limitations, such as hyperspectral imaging technology, it suffers from a reduction in the spatial ...

متن کامل

Exploring Probabilistic Toponym Resolution for Geographical Information Retrieval

A key problem that arises when unstructured text is being queried is that of properly recognizing and exploiting geographical terms and entities. Here we describe a mechanism for probabilistic toponym resolution, and our experiments with the new method in the setting of the 2005 GeoCLEF queries and judgments. The new method gives improved retrieval effectiveness on a subset of the topics.

متن کامل

Toponym Resolution in Text: “Which Sheffield is it?”

Named entity tagging comprises the sub-tasks of identifying a text span and classifying it, but this view ignores the relationship between the entities and the world. Spatial and temporal entities ground events in space-time, and this relationship is vital for applications such as question answering and event tracking. There is much recent work regarding the temporal dimension [13, 10], but no ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006